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CLAIMS 

1. A computer-based method yof drug design based on genetic 
polymorphisms, comprising: / 

obtaining more than one ami/o acid sequence of target proteins 
that are the product of a gene exhibiting genetic polymorphisms, wherein 
the sequences represent different genetic polymorphisms; 

generating 3-dimensional (3-D) protein structural variant models 
from the sequences; and / 

based upon the structures of the 3-D models, designing drug 
candidates, modifying existing drugs, identifying potential drug 
candidates or identifying modifications of existing drugs based on 
predicted intermofecular interactions of the drug candidates or modified 
drugs with the/structural variants. 

2. The method of claim 1, wherein the structure-based drug 
design method comprises: 

computationally docking the drug candidate or modified drug 
molecules with the target protein structural variant models; 
energetically refining the docked complexes; 

determining the binding interactions between the drug candidate or 
modified drug molecules and the structural variants; and 

designing and identifying drugs or modifications to existing drugs 
based on the binding interactions. 

3. The method of claim 2/wherein the binding interactions are 
determined by: / 

calculating the free energy of binding between the protein 
structural variant model ancrthe docked molecule; and 

decomposing the total free energy of binding based on the 
interacting residues irythe protein active site. 

4. The method of claim 1 wherein: 

after the protein structural variant models derived from a particular 
genetic polymorphism are generated, selected model structures are 
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analyzed to determine common sjr^jctural features that are conserved 
throughout the selected models, wherein 

the conserved stru<$tural features are used as a basis for structure- 
based drug design stj/aies. 

5. The method of claim 4, wherein the conserved structural 
features are stretches of non-contiguous residues, wherein each stretch 
contains at least two amino acids. 

6. The method of claim 5, wherein the protein is human 
immunodeficiency virus protease. 

7. \ The method of claim 6, wherein the conserved residues 
comprise residues comprise residues 1-9, 25-29, 49-52, 78-81 and 94- 
99; and wherein: 

residue\ 1 is an aliphatic amino acid; residue 2 is a hydrophilic 
amino acid; residue 3 is an aliphatic amino acid; residue 4 is a hydrophilic 
amino acid; residue 5 is a hydrophobic amino acid; residue 6 is an 
aromatic amino acid; residue 7 is a hydrophilic amino acid; residue 8 is a 
basic amino acid\ residue 9 is an aliphatic amino acid; residue 25 is an 
acidic amino acid\ residue 26 is a hydrophobic amino acid; residue 27 is 
an aliphatic aminoWid; residue 28 is an aliphatic amino acid; residue 29 
is an acidic amino acid; residue 49 is an aliphatic amino acid; residue 50 
is a hydrophobic amVio acid; residue 51 is an aliphatic amino acid; residue 
52 is an aliphatic amibo acid; residue 78 is an aliphatic amino acid; 
residue 79 is an alipha\ic amino acid; residue 80 is a hydrophilic amino 
acid; residue 81 is an amDhatic amino acid; residue 94 is an aliphatic 
amino acid; residue 95 is\a thio-containing amino acid; residue 96 is a 
hydrophilic amino acid; residue 97 is hydrophobic amino acid; residue 98 
is hydrophilic amino acid; and residue 99 is an aromatic amino acid. 

8. The method of claim 6, wherein the conserved residues 
comprise residues comprise residues 1-9, 25-29, 49-52, 78-81 and 94- 
99; and wherein: \ 
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residue 1 is proline; residue 2 is glutamine; residue 3 is isoleucine; residue 
4 is threonine; residue 5 is leucine; residue 6 is tryptophan; residue 7 is 
glutamine; residue 8 is arginine; residue 9 is proline; residue 25 is aspartic 
acid; residue 26 is threonine; residue 27 is glycine; residue 28 is alanine; 
residue 29 is aspartic acid; residue 49 is glycine; residue 50 is isoleucine; 
residue 51 is glycine; residue 52 is glycine; residue 78 is glycine; residue 
79 is proline; residue 80 is threonine; residue 81 is proline; residue 94 is 
glycine; residue 95 is cysteine; residue 96 is threonine; residue 97 is 
leucine; residue 98 is asparagine; and residue 99 is phenylalanine. 

9. The method of claim 6, wherein the HIV protease has the 
sequence of amino acids set forth in any of SEQ ID Nos. 3-74 and 77- 
117. 

10. The method of claim 9, wherein the residues comprise residues 
1-9, 25-29, 49-52, 78-81 and 94-99. 

The method of claim 1, wherein the selected model 
\^ structures represent the structural variants resulting from the most 
commorHy occurring genetic polymorphisms. 

*yf The\method of claim 1 , wherein the selected model 



£\structures represent the structural variants resulting from genetic 
polymorphisms fou\d in a selected patient subpopulation. 



The meihod of claim 1 wherein the structural variant models 
are stored in a relational database, comprising: 

3-D molecular coordinates for the structural variants; 
a molecular graphics interface for 3-D molecular structure 
visualization; computer functionality for protein sequence and 
structural analyses; and 

database searching tools. ^ 
'^2^ The method of claim ^2", wherein the database further 
comprises one or more of observed clinical data associated with the 
genetic polymorphisms, subject medical history and subject history. 
2fi ) The method of/claim 1, wherein: 



after generating the 3-D p^tein structural variant models, 
the method comprises: 

computationally doc|<fng drug molecules with the target 
protein models; and 

energetically raining the docked complexes; and 
wherein the candidate drugs are specific for a protein with a 
selected polymorph]*™ or specifically interact with all proteins exhibiting a 
polymorphism. f 

The method of claim >4? wherein the structure-based drug 
design method comprises: 

computationally docking drug or potential new drug candidate 
molecules with the target protein structural variant models; 
energetically refining the docked complexes; 

determining the binding interactions between the drug or potential 
new drug candidate molecules and the structural variants; and 

designing potential new drugs or modifications to existing drugs 
based on the binding interactions. ^ 

The method of claim to, wherein the binding interactions are 
determined by: 

calculating the free energy of binding between the protein 
structural variant model and the docked molecule; and 

decomposing the total free energy of binding based on the 
interacting residues in the protein active site. 

The method of claim lA^'wherein: 

after the protein structural variant models derived from a particular 
genetic polymorphism are generated, selected model structures are 
analyzed to determine common structural features that are conserved 
throughout the selected models; and 

the conserved structural features are used as a basis for structure- 
based drug design studies. 
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y&. The method of claim wherein the selected model 
structures represent the structural variants resulting from the most 
commonly occurring genetic polymorphisms. 
c3V "$$C ~^he metnoc ' °f claim ^/h, wherein the selected model 
structure* represent the structural variants resulting from genetic 
polymorphisms found in a specific patient subpopulation. 

Ij^f. The method of claim wherein the selected model 
structures represent structural variants derived from patients the receive a 
$\ specific treatment reginipn. _ 

. The methodW claim 12", wherein the selected model 
structures represent structural variants derived from patients that exhibit 
a particular clinical responses\to a given drug. 

The method of claim 1^?, wherein the selected model 
structures represent structural variants derived based on the duration of a 
particular drug treatment. ^ 

, The method of claim 1^, wherein the structural variant 
c,u^ m °dels are scored in a relational database, comprising: 

3-D molecular coordinates for the structural variants; 
a molecular graphics interface for 3-D molecular structure 
visualization; and 

functionalit^Vfor protein sequence and structural analysis; and 
database searching tools. _ 

The method of claim Vz, wherein the database further 
comprises observed clinical data associated with the genetic 
polymorphisms, subject medical history and subject history. 

-rfSfc Ac cVnputer-based method of selecting drug therapies for 
patients based on genetic polymorphisms, comprising: 
I obtaining amino acid sequences of a target protein that is the 

£ \ product of a gene exhibiting genetic polymorphisms, wherein the 
sequences represent different genetic polymorphisms; 
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generating >3-D protein structural variant models from the 
sequences; 

computationally docking drug molecules with the target protein 
models; 

energetically reVfining the docked complexes; 
determining thelbinding interactions between the drug or potential 
new drug candidate mtflBqules and the models; and 

selecting drujg therapies based on the drug or drugs that have the 
most favorable b'u^qirfgpteractions with the structural variant models. 

tP<Z&. The method of claim £5^vherein the binding interactions are 
determined by: 

calculating the free\energy of binding between the protein 
structural variant and the docked drug molecule; and 

decomposing the total free energy of binding based on the 
interacting residues in the prc\tein active site. 
£ v The method of claim 1 , further after generating the 3-D 

structuralVariant models, exporting some or all of them models into a 
program that computationally docks the models with test compounds to 
assess intermolecular interactions. 

A dpmputer-based method for predicting clinical responses in 
patients based on\genetic polymorphisms, comprising: 

obtaining on\ or more amino acid sequences for a target protein 
that is the product oXa gene exhibiting genetic polymorphisms; 

generating 3-D protein structural variant models from the 
sequences; 

building a relationalVlatabase of protein structural variants derived 
based on genetic polymorprVjsms and observed clinical data associated 
with particular polymorphismsVexhibited in the patients, wherein the 
database comprises: 

3-D molecular coordinates for the structural variant models; 
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ring the 3-D model derived from the subject to the 
the database by: 
;tructures in the database that are similar to the 
subject; and 

pretiictirtg b clinical outcome for the patient based on the 
clinical date asso$iated/with the identified structures. 

A cqmputef^based method for designing therapeutic agents 
that are active against biological targets that have become drug resistant 
due to genetic mutations, comprising: 

obtaining a first 3\D protein structural variant model of a target 
protein against which a given drug has biological activity; 

generating a seconq 3-D protein structural variant model of the 
target in which genetic mutations have occurred and against which the 
same drug is no longer biologically active; 

comparing the structures of the first and second model to identify 
structural differences; and 

performing structure-t ased drug design calculations in order to 
identify new drugs or modifications to the existing drug to bring about 
biological activity against the second model. 
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i compLiter-based method for identifying compensatory 
mutations in targetlprotein, comprising: 

obtaining the amino acid sequence of a target protein containing 
multiple amino pcid mutations that is expressed in a patient, wherein the 
nf th 



drug, including ^he act 
generating! a 3-D 



similarities arising\from 
comparing tfie bit 



determine the effedts o 



ve site, has been structurally characterized; 
structural model of the mutated protein; 
comparing! the structure of the mutated protein with the form of the 
protein that responds t( the drug to identify structural differences and/or 
the mutations; 

logical activities of the drug against both the 



mutated protein anp the form of the protein that responds to the drug to 



the mutations on drug response; and 



identifying the m nations in the protein that affect biological 



mparisons. 

or creating a 3-D structural polymorphism 
ing: 

ore a<nino acid sequences of a target protein 

ibiting a genetic polymorphism, wherein 
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activity based on the cc 

3l*- a, 

relational database 

obtaining one 
that is the product o 
sequences represent 

generating 3-D 
sequences; 

energetically refininb the models; 

evaluating the quality of the models; 

optionally obtaining associated clinical properties or data; and 
inputting the model apd any associated properties and/or data into 

a relational database. , 

The method of claim j*C, wherein after energetically refining 

the models, the models are further refined 
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t^lf Th\ method of claim ?SC, wherein the database comprises 

amino sequences of two or more polymorphic variants. 

■jjj 2A. The method of claim 3*, wherein the database comprises 
amino sequenceslof ten or more polymorphic variants . 

The method of claim 3^^/vherein the database comprises 
amino sequences of about 100 or more polymorphic variants . 

The method of claim 2%, wherein the database comprises 
amino sequemjes ofyjbout 1000 or more polymorphic variants . 

rj^S^C The\method\of claim 3^ therein the database comprises 
amino sequences! ofjpdre than 8000 polymorphic variants. 

A database created by the method of claim ^IT 
The database of claim 38^comprising variant 3-dimensional 
structures of a selected target. ^ 

The database of claim $8 that comprises structures of 
proteases or polymerases. 



The database of claim 38, wherein the proteases are viral 



proteases or polymerase^. \ 



The databasAof claim ^8T, wherein the viral proteases are 
human immunodeficiency virus proteases and the polymerase is a viral 
reverse transcriptase^ &f*>2~ 

The method of claim 3^, wherein quality is assessed by 
computing the normalized residue energies such that if e av is > 1.5 a 
model is further refined until e av is < 1 .5; if e av is < 1.5 a model is 
deposited into the database. 

The method of cjamri 1 , wherein the target is an enzyme. 
Tn e method of claim ^4-, wherein the enzyme is a protease 
or polymerase. 

The method of claim wherein the polymerase is a reverse 
transcriptase. ^ 

Tne metnod oi claim ^4, wherein the target is a protein 
expressed by an infectious agent. 
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The method of claim 44, wherein the target is enzyme 
expressed by a an infectious agent. 

50*^^. The method of claim 4c, wherein the agent is a human 
immunodeficiency virus (HIV). 

^j^€f. A coVnputer system, comprising a database containing data 
representative of the three dimensional structure of polymorphic variants 
of a drug target. 

S29^- Tne sys\em of claim f>8, wherein the target is a cell surface 
receptor or an enzyme. 

5^5=2^ The system of claim Ej^, wherein the enzyme is a protease or 
a polymerase. 

^ A database, tomprising: 

sequences of nucleotides encoding a protein or portions thereof, 
^wherein proteins comprise oolymorphic variants; and the portions encode 
a domain of the\protein that\comprises a site in the protein that binds to a 
drug candidates; ^nd 

the coordinates of 3-)dirlhe<isional (3-D) structures of the encoded 
proteins or portions thereof. \ \ ^ ^ 



1 that is a relational database. 
1 5^lYhat comprises at least 2 



The database oflcla. 
SJtfS^K?. The database oT 
polymorphic variants and the corresponding 3-D structures. 

f^l^}®- The database of clairn\^^that comprises at more than 10, 
more than 100, moreVhan 1000, m&re than 8000, or more than 10,000 
polymorphic variants and. the corresponding 3-D structures. 

The database of claim 5a; wherein the protein is a receptor 
or enzyme from a eukaryotic or prokarvotic organism. 



jjt^gar. The database of claim 56/\wherein the organism is a 
pathogen or a mammal. ^ 

The database of claim ^3, vVherein the organism is a 
pathogen is a virus or bacterium and the mammal is a human. 
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^jjO. The database of claim &3", wherein the protein is a protease 
or a reverse transcriptase. 

b2J$^f- A database, comprising the sequences of nucleotides set 
forth in SEQ ID Nos.b-1 17 that encode HIV protease or the portion of 
HIV reverse transcrmfb^eset forth in each SEQ ID. 

The da\aoiase\of claim 53,^further comprising 3-D structural 
coordinates for a protejVor portion thereof comprising sequences of 
amino acids encoded t3y\each of SEQ ID Nos. 3-1 17. 

\c$fy£ The databasie of claim 54^wherein the protein is HIV 
protease. « ^ 

The database\of claim J»4, wherein the protein is HIV reverse 

\ 

transcriptase. A 

The method of claim 1 , wherein the target protein is a 
eukaryotic or prokaryotic protein. 

The method of claim 1 , wherein the target protein is an 
animal protein, a plant protein or a protein from a pathogen. 
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